Àá½Ã¸¸ ±â´Ù·Á ÁÖ¼¼¿ä. ·ÎµùÁßÀÔ´Ï´Ù.
KMID : 0603720080140010037
Journal of Korean Society of Medical Informatics
2008 Volume.14 No. 1 p.37 ~ p.44
Comparisons of predictive modeling techniques for breast cancer in Korean women
Lee Sun-Mi

Abstract
Objective : To develop breast cancer prediction models and to compare their predictive performance by using Bayesian Networks (BN), Naive Bayes (NB), Classification and Regression Trees (CART), and Logistic Regression (LR).

Methods : The dataset consisting of 109 breast cancer patients and 100 healthy women was used. Hugin ResearcherTM 6.7 and Poulin?Hugin 1.5, both of which are NB modeling software, were used. For the LR model and CART, ECMiner was used.

Results : The highest area under the receiver operating characteristic curve (AUC) was shown in the Tree augmented NB model as .90. The lowest AUC was CART with .48; that of the LR model was .86. Two BN models with prior knowledge and without prior knowledge did not show any difference at all (.64 vs. .65). The lifts of four models (Simple NB, Tree Augmented NB, Hierarchical NB, LR) were 1.9. The AUCs in both the NB and LR models were higher than that of the previously established models that have been published by using LR methods.

Conclusion : NB could be preferred to LR in the development of a predictive model to promote regular screening tests and early detection, which is more or less free from statistical assumptions and limitations. (Journal of Korean Society of Medical Informatics 14-1, 37-44, 2008)
KEYWORD
Bayesian Network, Naive Bayes, CART, Logistic Regression, Cancer, Risk Assessment, Predictive Model
FullTexts / Linksout information
 
Listed journal information
ÇмúÁøÈïÀç´Ü(KCI) KoreaMed ´ëÇÑÀÇÇÐȸ ȸ¿ø